On Classifying Discussion Threads Using Travel Information Goal-Oriented Model

نویسندگان

  • San-Yih Hwang
  • Roger H. L. Chiang
  • Yung-Lin Hsiao
  • Shanlin Chang
چکیده

We study how to recommend discussion threads in the tourism domain to meet visitors’ travel information needs. This research-in-progress paper reports the first stage of our research, namely classifying discussion threads into travel goals. We propose an information goal-oriented model, which consists of four goals: Initiation, Attraction, Accommodation, and Route planning, that can be characterized using nine features. Seven of these nine features can be quantified based on lexicons, and the other two can be measured using the named entity recognition technique. Three lexicons can be further enhanced using WordNet. We conduct an experiment in evaluating the impact of these features on goal classification with a data set collected from TripAdvisor.com, the world's largest travelling website. The experimental results show that our approach generally has comparable or better performance than that of using purely lexical features, namely TF-IDF, for classification.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Modeling Threaded Discussions using Induced Ontology Knowledge

Online discussion boards are a popular form of web-based computer-mediated communication, especially in the areas of distributed education and customer support. Automatic analysis for discussion understanding would enable better information assessment and assistance. This paper describes an extensive study of the relationship between individual messages and full discussion threads. We present a...

متن کامل

Automatic Thread Classification for Linux User Forum Information Access

We experiment with text classification of threads from Linux web user forums, in the context of improving information access to the problems and solutions described in the threads. We specifically focus on classifying threads according to: (1) them describing a specific problem vs. containing a more general discussion; (2) the completeness of the initial post in the thread; and (3) whether prob...

متن کامل

An enhanced reliability-oriented workforce planning model for process industry using combined fuzzy goal programming and differential evolution approach

This paper draws on the “human reliability” concept as a structure for gaining insight into the maintenance workforce assessment in a process industry. Human reliability hinges on developing the reliability of humans to a threshold that guides the maintenance workforce to execute accurate decisions within the limits of resources and time allocations. This concept offers a worthwhile point of de...

متن کامل

Generating Supplementary Travel Guides from Social Media

In this paper we study how to summarize travel-related information in forum threads to generate supplementary travel guides. Such summaries presumably can provide additional and more up-to-date information to tourists. Existing multi-document summarization methods have limitations for this task because (1) they do not generate structured summaries but travel guides usually follow a certain temp...

متن کامل

Online discussion groups for bulimia nervosa: an inductive approach to Internet-based communication between patients.

OBJECTIVE Online discussion forums are often used by people with eating disorders. METHOD This study analyses 2,072 threads containing a total of 14,903 postings from an unmoderated German "prorecovery" forum for persons suffering from bulimia nervosa (www.ab-server.de) during the period from October 2004 to May 2006. The threads were inductively analyzed for underlying structural types, and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013